A Novel Exploration/Exploitation Policy Accelerating Learning in Both Stationary and Non-Stationary Environment Navigation Tasks
نویسندگان
چکیده
منابع مشابه
Non-stationary Subtasks Can Improve Diversity in Stationary Tasks
Low diversity in a genetic algorithm (GA) can cause the search to become stagnant upon reaching a local optimum. To some extent, non-stationary tasks avoid this problem, which would be a desirable feature of GA for stationary tasks as well. With this in mind, we show that several methods of introducing artificial non-stationary elements help to promote diversity in a GA while working on an inhe...
متن کاملAdaptive robot learning in a non-stationary environment
Adaptive control is challenging in real-world applications such as robotics. Learning has to be rapid enough to be performed in real time and to avoid damage to the robot. Models using linear function approximation are interesting in such tasks because they offer rapid learning and have small memory and processing requirements. This makes them suitable as adaptive controllers in nonstationary e...
متن کاملLearning dynamical systems in a stationary environment
We consider the problem of learning the input–output relation of a dynamical system from noisy data. Our method rests on the use of a smooth simultaneous estimator which generalizes the standard empirical estimator. In a stationary environment, our algorithm is shown to select a model which exhibits the Probably Approximately Correct (PAC) property under very mild conditions. This contribution ...
متن کاملNon-Stationary Policy Learning in 2-Player Zero Sum Games
A key challenge in multiagent environments is the construction of agents that are able to learn while acting in the presence of other agents that are simultaneously learning and adapting. These domains require on-line learning methods without the benefit of repeated training examples, as well as the ability to adapt to the evolving behavior of other agents in the environment. The difficulty is ...
متن کاملBargaining in a non-stationary environment
We study an alternating offers bargaining model in which the set of possible utility pairs evolves through time in a non-stationary, but smooth manner. In general, there exists a multiplicity of subgame perfect equilibria. However, we show that in the limit as the time interval between two consecutive offers becomes arbitrarily small, there exists a unique subgame perfect equilibrium. Furthermo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer and Electrical Engineering
سال: 2015
ISSN: 1793-8163
DOI: 10.17706/ijcee.2015.7.3.149-158